Yellowhead County
Estimating Blood Pressure with a Camera: An Exploratory Study of Ambulatory Patients with Cardiovascular Disease
Curran, Theodore, Ma, Chengqian, Liu, Xin, McDuff, Daniel, Narayanswamy, Girish, Stergiou, George, Patel, Shwetak, Yang, Eugene
Hypertension is a leading cause of morbidity and mortality worldwide. The ability to diagnose and treat hypertension in the ambulatory population is hindered by limited access and poor adherence to current methods of monitoring blood pressure (BP), specifically, cuff-based devices. Remote photoplethysmography (rPPG) evaluates an individual's pulse waveform through a standard camera without physical contact. Cameras are readily available to the majority of the global population via embedded technologies such as smartphones, thus rPPG is a scalable and promising non-invasive method of BP monitoring. The few studies investigating rPPG for BP measurement have excluded high-risk populations, including those with cardiovascular disease (CVD) or its risk factors, as well as subjects in active cardiac arrhythmia. The impact of arrhythmia, like atrial fibrillation, on the prediction of BP using rPPG is currently uncertain. We performed a study to better understand the relationship between rPPG and BP in a real-world sample of ambulatory patients from a cardiology clinic with established CVD or risk factors for CVD. We collected simultaneous rPPG, PPG, BP, ECG, and other vital signs data from 143 subjects while at rest, and used this data plus demographics to train a deep learning model to predict BP. We report that facial rPPG yields a signal that is comparable to finger PPG. Pulse wave analysis (PWA)-based BP estimates on this cohort performed comparably to studies on healthier subjects, and notably, the accuracy of BP prediction in subjects with atrial fibrillation was not inferior to subjects with normal sinus rhythm. In a binary classification task, the rPPG model identified subjects with systolic BP $\geq$ 130 mm Hg with a positive predictive value of 71% (baseline prevalence 48.3%), highlighting the potential of rPPG for hypertension monitoring.
- North America > United States > California > Orange County > Irvine (0.04)
- North America > Canada > Alberta > Census Division No. 14 > Yellowhead County (0.04)
- Europe > Switzerland > Basel-City > Basel (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues
Sarto, Sara, Cornia, Marcella, Baraldi, Lorenzo, Cucchiara, Rita
Effectively aligning with human judgment when evaluating machine-generated image captions represents a complex yet intriguing challenge. Existing evaluation metrics like CIDEr or CLIP-Score fall short in this regard as they do not take into account the corresponding image or lack the capability of encoding fine-grained details and penalizing hallucinations. To overcome these issues, in this paper, we propose BRIDGE, a new learnable and reference-free image captioning metric that employs a novel module to map visual features into dense vectors and integrates them into multi-modal pseudo-captions which are built during the evaluation process. This approach results in a multimodal metric that properly incorporates information from the input image without relying on reference captions, bridging the gap between human judgment and machine-generated image captions. Experiments spanning several datasets demonstrate that our proposal achieves state-of-the-art results compared to existing reference-free evaluation scores.
- Europe > Italy (0.04)
- Oceania > Australia > Western Australia > North West Shelf (0.04)
- North America > United States > Mississippi (0.04)
- (2 more...)
Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
Yu, Dingli, Kaur, Simran, Gupta, Arushi, Brown-Cohen, Jonah, Goyal, Anirudh, Arora, Sanjeev
With LLMs shifting their role from statistical modeling of language to serving as general-purpose AI agents, how should LLM evaluations change? Arguably, a key ability of an AI agent is to flexibly combine, as needed, the basic skills it has learned. The capability to combine skills plays an important role in (human) pedagogy and also in a paper on emergence phenomena (Arora & Goyal, 2023). This work introduces Skill-Mix, a new evaluation to measure ability to combine skills. Using a list of $N$ skills the evaluator repeatedly picks random subsets of $k$ skills and asks the LLM to produce text combining that subset of skills. Since the number of subsets grows like $N^k$, for even modest $k$ this evaluation will, with high probability, require the LLM to produce text significantly different from any text in the training set. The paper develops a methodology for (a) designing and administering such an evaluation, and (b) automatic grading (plus spot-checking by humans) of the results using GPT-4 as well as the open LLaMA-2 70B model. Administering a version of to popular chatbots gave results that, while generally in line with prior expectations, contained surprises. Sizeable differences exist among model capabilities that are not captured by their ranking on popular LLM leaderboards ("cramming for the leaderboard"). Furthermore, simple probability calculations indicate that GPT-4's reasonable performance on $k=5$ is suggestive of going beyond "stochastic parrot" behavior (Bender et al., 2021), i.e., it combines skills in ways that it had not seen during training. We sketch how the methodology can lead to a Skill-Mix based eco-system of open evaluations for AI capabilities of future models.
- North America > United States > New York (0.04)
- North America > Canada > Alberta > Census Division No. 14 > Yellowhead County (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- (2 more...)
- Research Report (1.00)
- Personal > Interview (0.46)
- Education (1.00)
- Government (0.67)
- Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)
Li3DeTr: A LiDAR based 3D Detection Transformer
Erabati, Gopi Krishna, Araujo, Helder
Inspired by recent advances in vision transformers for object detection, we propose Li3DeTr, an end-to-end LiDAR based 3D Detection Transformer for autonomous driving, that inputs LiDAR point clouds and regresses 3D bounding boxes. The LiDAR local and global features are encoded using sparse convolution and multi-scale deformable attention respectively. In the decoder head, firstly, in the novel Li3DeTr cross-attention block, we link the LiDAR global features to 3D predictions leveraging the sparse set of object queries learnt from the data. Secondly, the object query interactions are formulated using multi-head self-attention. Finally, the decoder layer is repeated $L_{dec}$ number of times to refine the object queries. Inspired by DETR, we employ set-to-set loss to train the Li3DeTr network. Without bells and whistles, the Li3DeTr network achieves 61.3% mAP and 67.6% NDS surpassing the state-of-the-art methods with non-maximum suppression (NMS) on the nuScenes dataset and it also achieves competitive performance on the KITTI dataset. We also employ knowledge distillation (KD) using a teacher and student model that slightly improves the performance of our network.
- North America > Canada > Alberta > Census Division No. 14 > Yellowhead County (0.04)
- Europe > Portugal > Coimbra > Coimbra (0.04)
- Education (0.54)
- Information Technology (0.35)
- Transportation > Ground > Road (0.35)
Deep Recursive Embedding for High-Dimensional Data
Zhou, Zixia, Zu, Xinrui, Wang, Yuanyuan, Lelieveldt, Boudewijn P. F., Tao, Qian
Embedding high-dimensional data onto a low-dimensional manifold is of both theoretical and practical value. In this paper, we propose to combine deep neural networks (DNN) with mathematics-guided embedding rules for high-dimensional data embedding. We introduce a generic deep embedding network (DEN) framework, which is able to learn a parametric mapping from high-dimensional space to low-dimensional space, guided by well-established objectives such as Kullback-Leibler (KL) divergence minimization. We further propose a recursive strategy, called deep recursive embedding (DRE), to make use of the latent data representations for boosted embedding performance. We exemplify the flexibility of DRE by different architectures and loss functions, and benchmarked our method against the two most popular embedding methods, namely, t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP). The proposed DRE method can map out-of-sample data and scale to extremely large datasets. Experiments on a range of public datasets demonstrated improved embedding performance in terms of local and global structure preservation, compared with other state-of-the-art embedding methods.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Asia > China > Shanghai > Shanghai (0.04)
- Europe > Netherlands > South Holland > Leiden (0.04)
- (3 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- Information Technology > Data Science (0.93)
MulViMotion: Shape-aware 3D Myocardial Motion Tracking from Multi-View Cardiac MRI
Meng, Qingjie, Qin, Chen, Bai, Wenjia, Liu, Tianrui, de Marvao, Antonio, O'Regan, Declan P, Rueckert, Daniel
Recovering the 3D motion of the heart from cine cardiac magnetic resonance (CMR) imaging enables the assessment of regional myocardial function and is important for understanding and analyzing cardiovascular disease. However, 3D cardiac motion estimation is challenging because the acquired cine CMR images are usually 2D slices which limit the accurate estimation of through-plane motion. To address this problem, we propose a novel multi-view motion estimation network (MulViMotion), which integrates 2D cine CMR images acquired in short-axis and long-axis planes to learn a consistent 3D motion field of the heart. In the proposed method, a hybrid 2D/3D network is built to generate dense 3D motion fields by learning fused representations from multi-view images. To ensure that the motion estimation is consistent in 3D, a shape regularization module is introduced during training, where shape information from multi-view images is exploited to provide weak supervision to 3D motion estimation. We extensively evaluate the proposed method on 2D cine CMR images from 580 subjects of the UK Biobank study for 3D motion tracking of the left ventricular myocardium. Experimental results show that the proposed method quantitatively and qualitatively outperforms competing methods.
- Europe > United Kingdom (0.48)
- North America > United States (0.04)
- North America > Canada > Alberta > Census Division No. 14 > Yellowhead County (0.04)
- (2 more...)
AI godfather Geoff Hinton: "Deep learning is going to be able to do everything"
The modern AI revolution began during an obscure research contest. It was 2012, the third year of the annual ImageNet competition, which challenged teams to build computer vision systems that would recognize 1,000 objects, from animals to landscapes to people. In the first two years, the best teams had failed to reach even 75% accuracy. But in the third, a band of three researchers--a professor and his students--suddenly blew past this ceiling. They won the competition by a staggering 10.8 percentage points. That professor was Geoffrey Hinton, and the technique they used was called deep learning.
The music moves us -- but how?
Music and dance are so deeply embedded in the human experience that we almost take them for granted. They're distinct from one another, but intimately related: Music -- arrangements of sound over time -- causes us to move our bodies in space. Without knowing it, we track pulse, tempo and rhythm, and we move in response. But only recently have scientists developed the tools, and the inclination, to quantitatively study the human response to music in its many forms. It's a research program that relies on a wide array of approaches, employing techniques from the study of perception and cognition to those of neurobiology and neuroimaging, with additional insights from psychophysics, evolutionary psychology and animal studies.
- North America > Canada > Quebec > Montreal (0.14)
- North America > United States > New York (0.05)
- Oceania > Papua New Guinea (0.04)
- (3 more...)
- Health & Medicine (0.91)
- Media > Music (0.47)
Heroes of Machine Learning - Top Experts & researchers you should follow
What a time this is to be working in the machine learning field! The last few years have been a dream run for anyone associated with machine learning as there have been a slew of developments and breakthroughs at an unprecedented pace. There's just one thing to keep in mind here – these breakthroughs did not happen overnight. It took years and in some cases, decades, of hard work and persistence. We are used to working with established machine learning algorithms like neural networks and random forest (and so on). We tend to lose sight of the effort it took to make these algorithms mainstream. To actually create them from scratch. The people who lay the groundwork for us – those are the true heroes of machine learning.
- North America > Canada > Ontario > Toronto (0.14)
- North America > United States > California > Alameda County > Berkeley (0.05)
- North America > Canada > Quebec > Montreal (0.05)
- (6 more...)
- Health & Medicine (1.00)
- Education > Educational Setting > Online (0.96)
- Education > Educational Technology > Educational Software > Computer Based Training (0.47)
Agency plus automation: Designing artificial intelligence into interactive systems
Much contemporary rhetoric regards the prospects and pitfalls of using artificial intelligence techniques to automate an increasing range of tasks, especially those once considered the purview of people alone. These accounts are often wildly optimistic, understating outstanding challenges while turning a blind eye to the human labor that undergirds and sustains ostensibly "automated" services. This long-standing focus on purely automated methods unnecessarily cedes a promising design space: one in which computational assistance augments and enriches, rather than replaces, people's intellectual work. This tension between human agency and machine automation poses vital challenges for design and engineering. In this work, we consider the design of systems that enable rich, adaptive interaction between people and algorithms. We seek to balance the often-complementary strengths and weaknesses of each, while promoting human control and skillful action. We share case studies of interactive systems we have developed in three arenas--data wrangling, exploratory analysis, and natural language translation--that integrate proactive computational support into interactive systems. To improve outcomes and support learning by both people and machines, we describe the use of shared representations of tasks augmented with predictive models of human capabilities and actions. We conclude with a discussion of future prospects and scientific frontiers for intelligence augmentation research. Although sharing overlapping origins in midcentury computer science, research programs in intelligence augmentation (IA; using computers to extend people's ability to process information and reason about complex problems) and artificial intelligence (AI; developing computational methods for perception, reasoning, and action) have to date charted largely separate trajectories.
- North America > United States > District of Columbia > Washington (0.04)
- North America > United States > Alabama (0.04)
- North America > Canada > Alberta > Census Division No. 9 > Clearwater County (0.04)
- (2 more...)
- Health & Medicine (0.49)
- Transportation > Air (0.46)